Overview

Dataset statistics

Number of variables22
Number of observations8970
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 MiB
Average record size in memory156.0 B

Variable types

Numeric7
Categorical15

Alerts

Country has constant value "United States" Constant
OrderID has a high cardinality: 4711 distinct values High cardinality
OrderDate has a high cardinality: 1228 distinct values High cardinality
ShipDate has a high cardinality: 1322 distinct values High cardinality
CustomerID has a high cardinality: 792 distinct values High cardinality
CustomerName has a high cardinality: 792 distinct values High cardinality
City has a high cardinality: 519 distinct values High cardinality
ProductID has a high cardinality: 1847 distinct values High cardinality
ProductName has a high cardinality: 1835 distinct values High cardinality
df_index is highly correlated with RowIDHigh correlation
RowID is highly correlated with df_indexHigh correlation
PostalCode is highly correlated with State and 1 other fieldsHigh correlation
Sales is highly correlated with ProfitHigh correlation
Quantity is highly correlated with CountryHigh correlation
Discount is highly correlated with State and 1 other fieldsHigh correlation
Profit is highly correlated with SalesHigh correlation
ShipMode is highly correlated with CountryHigh correlation
Segment is highly correlated with CountryHigh correlation
Country is highly correlated with Segment and 5 other fieldsHigh correlation
State is highly correlated with PostalCode and 2 other fieldsHigh correlation
Region is highly correlated with State and 1 other fieldsHigh correlation
Category is highly correlated with SubCategoryHigh correlation
SubCategory is highly correlated with Category and 1 other fieldsHigh correlation
df_index is uniformly distributed Uniform
RowID is uniformly distributed Uniform
OrderID is uniformly distributed Uniform
df_index has unique values Unique
RowID has unique values Unique
Discount has 4712 (52.5%) zeros Zeros

Reproduction

Analysis started2022-10-31 19:04:30.875496
Analysis finished2022-10-31 19:05:56.714239
Duration1 minute and 25.84 seconds
Software versionpandas-profiling v3.4.0
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

HIGH CORRELATION
UNIFORM
UNIQUE

Distinct8970
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4982.67068
Minimum0
Maximum9993
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size70.2 KiB

Quantile statistics

Minimum0
5-th percentile508.45
Q12486.25
median4993.5
Q37451.5
95-th percentile9493.55
Maximum9993
Range9993
Interquartile range (IQR)4965.25

Descriptive statistics

Standard deviation2875.094184
Coefficient of variation (CV)0.5770187051
Kurtosis-1.189201873
Mean4982.67068
Median Absolute Deviation (MAD)2482
Skewness0.008813774477
Sum44694556
Variance8266166.565
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01
 
< 0.1%
66161
 
< 0.1%
66101
 
< 0.1%
66111
 
< 0.1%
66121
 
< 0.1%
66131
 
< 0.1%
66141
 
< 0.1%
66151
 
< 0.1%
66171
 
< 0.1%
66081
 
< 0.1%
Other values (8960)8960
99.9%
ValueCountFrequency (%)
01
< 0.1%
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
61
< 0.1%
71
< 0.1%
81
< 0.1%
91
< 0.1%
ValueCountFrequency (%)
99931
< 0.1%
99921
< 0.1%
99911
< 0.1%
99901
< 0.1%
99891
< 0.1%
99881
< 0.1%
99871
< 0.1%
99861
< 0.1%
99851
< 0.1%
99831
< 0.1%

RowID
Real number (ℝ≥0)

HIGH CORRELATION
UNIFORM
UNIQUE

Distinct8970
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4983.67068
Minimum1
Maximum9994
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size35.2 KiB

Quantile statistics

Minimum1
5-th percentile509.45
Q12487.25
median4994.5
Q37452.5
95-th percentile9494.55
Maximum9994
Range9993
Interquartile range (IQR)4965.25

Descriptive statistics

Standard deviation2875.094184
Coefficient of variation (CV)0.5769029232
Kurtosis-1.189201873
Mean4983.67068
Median Absolute Deviation (MAD)2482
Skewness0.008813774477
Sum44703526
Variance8266166.565
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11
 
< 0.1%
66171
 
< 0.1%
66111
 
< 0.1%
66121
 
< 0.1%
66131
 
< 0.1%
66141
 
< 0.1%
66151
 
< 0.1%
66161
 
< 0.1%
66181
 
< 0.1%
66091
 
< 0.1%
Other values (8960)8960
99.9%
ValueCountFrequency (%)
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
61
< 0.1%
71
< 0.1%
81
< 0.1%
91
< 0.1%
101
< 0.1%
ValueCountFrequency (%)
99941
< 0.1%
99931
< 0.1%
99921
< 0.1%
99911
< 0.1%
99901
< 0.1%
99891
< 0.1%
99881
< 0.1%
99871
< 0.1%
99861
< 0.1%
99841
< 0.1%

OrderID
Categorical

HIGH CARDINALITY
UNIFORM

Distinct4711
Distinct (%)52.5%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
CA-2017-100111
 
13
CA-2017-157987
 
12
CA-2016-165330
 
11
US-2016-108504
 
11
US-2015-126977
 
10
Other values (4706)
8913 

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters125580
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2496 ?
Unique (%)27.8%

Sample

1st rowCA-2016-152156
2nd rowCA-2016-152156
3rd rowCA-2016-138688
4th rowUS-2015-108966
5th rowUS-2015-108966

Common Values

ValueCountFrequency (%)
CA-2017-10011113
 
0.1%
CA-2017-15798712
 
0.1%
CA-2016-16533011
 
0.1%
US-2016-10850411
 
0.1%
US-2015-12697710
 
0.1%
CA-2015-13133810
 
0.1%
CA-2016-1057329
 
0.1%
CA-2015-1326269
 
0.1%
CA-2017-1409499
 
0.1%
CA-2015-1584219
 
0.1%
Other values (4701)8867
98.9%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
ca-2017-10011113
 
0.1%
ca-2017-15798712
 
0.1%
ca-2016-16533011
 
0.1%
us-2016-10850411
 
0.1%
us-2015-12697710
 
0.1%
ca-2015-13133810
 
0.1%
ca-2015-1584219
 
0.1%
ca-2015-1648829
 
0.1%
ca-2017-1409499
 
0.1%
ca-2015-1326269
 
0.1%
Other values (4701)8867
98.9%

Most occurring characters

ValueCountFrequency (%)
122887
18.2%
-17940
14.3%
013890
11.1%
213777
11.0%
C7556
 
6.0%
A7556
 
6.0%
67081
 
5.6%
76682
 
5.3%
46635
 
5.3%
56604
 
5.3%
Other values (5)14972
11.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number89700
71.4%
Dash Punctuation17940
 
14.3%
Uppercase Letter17940
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
122887
25.5%
013890
15.5%
213777
15.4%
67081
 
7.9%
76682
 
7.4%
46635
 
7.4%
56604
 
7.4%
34905
 
5.5%
83673
 
4.1%
93566
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
C7556
42.1%
A7556
42.1%
U1414
 
7.9%
S1414
 
7.9%
Dash Punctuation
ValueCountFrequency (%)
-17940
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common107640
85.7%
Latin17940
 
14.3%

Most frequent character per script

Common
ValueCountFrequency (%)
122887
21.3%
-17940
16.7%
013890
12.9%
213777
12.8%
67081
 
6.6%
76682
 
6.2%
46635
 
6.2%
56604
 
6.1%
34905
 
4.6%
83673
 
3.4%
Latin
ValueCountFrequency (%)
C7556
42.1%
A7556
42.1%
U1414
 
7.9%
S1414
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII125580
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
122887
18.2%
-17940
14.3%
013890
11.1%
213777
11.0%
C7556
 
6.0%
A7556
 
6.0%
67081
 
5.6%
76682
 
5.3%
46635
 
5.3%
56604
 
5.3%
Other values (5)14972
11.9%

OrderDate
Categorical

HIGH CARDINALITY

Distinct1228
Distinct (%)13.7%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
9/2/2017
 
35
11/10/2016
 
34
9/5/2016
 
32
12/1/2017
 
31
12/8/2017
 
29
Other values (1223)
8809 

Length

Max length10
Median length9
Mean length9.065663322
Min length8

Characters and Unicode

Total characters81319
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique138 ?
Unique (%)1.5%

Sample

1st row11/8/2016
2nd row11/8/2016
3rd row6/12/2016
4th row10/11/2015
5th row10/11/2015

Common Values

ValueCountFrequency (%)
9/2/201735
 
0.4%
11/10/201634
 
0.4%
9/5/201632
 
0.4%
12/1/201731
 
0.3%
12/8/201729
 
0.3%
12/9/201729
 
0.3%
12/11/201628
 
0.3%
11/12/201728
 
0.3%
11/24/201627
 
0.3%
12/2/201727
 
0.3%
Other values (1218)8670
96.7%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
9/2/201735
 
0.4%
11/10/201634
 
0.4%
9/5/201632
 
0.4%
12/1/201731
 
0.3%
12/8/201729
 
0.3%
12/9/201729
 
0.3%
12/11/201628
 
0.3%
11/12/201728
 
0.3%
12/2/201727
 
0.3%
11/24/201627
 
0.3%
Other values (1218)8670
96.7%

Most occurring characters

ValueCountFrequency (%)
117993
22.1%
/17940
22.1%
214311
17.6%
010601
13.0%
74448
 
5.5%
63795
 
4.7%
53398
 
4.2%
43235
 
4.0%
92080
 
2.6%
32017
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number63379
77.9%
Other Punctuation17940
 
22.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
117993
28.4%
214311
22.6%
010601
16.7%
74448
 
7.0%
63795
 
6.0%
53398
 
5.4%
43235
 
5.1%
92080
 
3.3%
32017
 
3.2%
81501
 
2.4%
Other Punctuation
ValueCountFrequency (%)
/17940
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common81319
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
117993
22.1%
/17940
22.1%
214311
17.6%
010601
13.0%
74448
 
5.5%
63795
 
4.7%
53398
 
4.2%
43235
 
4.0%
92080
 
2.6%
32017
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII81319
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
117993
22.1%
/17940
22.1%
214311
17.6%
010601
13.0%
74448
 
5.5%
63795
 
4.7%
53398
 
4.2%
43235
 
4.0%
92080
 
2.6%
32017
 
2.5%

ShipDate
Categorical

HIGH CARDINALITY

Distinct1322
Distinct (%)14.7%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
12/16/2015
 
34
9/26/2017
 
32
9/6/2017
 
30
12/12/2017
 
29
11/21/2017
 
29
Other values (1317)
8816 

Length

Max length10
Median length9
Mean length9.07335563
Min length8

Characters and Unicode

Total characters81388
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)1.6%

Sample

1st row11/11/2016
2nd row11/11/2016
3rd row6/16/2016
4th row10/18/2015
5th row10/18/2015

Common Values

ValueCountFrequency (%)
12/16/201534
 
0.4%
9/26/201732
 
0.4%
9/6/201730
 
0.3%
12/12/201729
 
0.3%
11/21/201729
 
0.3%
12/6/201727
 
0.3%
9/15/201725
 
0.3%
9/13/201425
 
0.3%
11/16/201724
 
0.3%
9/8/201724
 
0.3%
Other values (1312)8691
96.9%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
12/16/201534
 
0.4%
9/26/201732
 
0.4%
9/6/201730
 
0.3%
12/12/201729
 
0.3%
11/21/201729
 
0.3%
12/6/201727
 
0.3%
9/15/201725
 
0.3%
9/13/201425
 
0.3%
9/26/201524
 
0.3%
9/8/201724
 
0.3%
Other values (1312)8691
96.9%

Most occurring characters

ValueCountFrequency (%)
117943
22.0%
/17940
22.0%
214399
17.7%
010508
12.9%
74489
 
5.5%
63959
 
4.9%
53459
 
4.3%
43148
 
3.9%
92039
 
2.5%
31934
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number63448
78.0%
Other Punctuation17940
 
22.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
117943
28.3%
214399
22.7%
010508
16.6%
74489
 
7.1%
63959
 
6.2%
53459
 
5.5%
43148
 
5.0%
92039
 
3.2%
31934
 
3.0%
81570
 
2.5%
Other Punctuation
ValueCountFrequency (%)
/17940
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common81388
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
117943
22.0%
/17940
22.0%
214399
17.7%
010508
12.9%
74489
 
5.5%
63959
 
4.9%
53459
 
4.3%
43148
 
3.9%
92039
 
2.5%
31934
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII81388
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
117943
22.0%
/17940
22.0%
214399
17.7%
010508
12.9%
74489
 
5.5%
63959
 
4.9%
53459
 
4.3%
43148
 
3.9%
92039
 
2.5%
31934
 
2.4%

ShipMode
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
Standard Class
5323 
Second Class
1778 
First Class
1375 
Same Day
 
494

Length

Max length14
Median length14
Mean length12.81326644
Min length8

Characters and Unicode

Total characters114935
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSecond Class
2nd rowSecond Class
3rd rowSecond Class
4th rowStandard Class
5th rowStandard Class

Common Values

ValueCountFrequency (%)
Standard Class5323
59.3%
Second Class1778
 
19.8%
First Class1375
 
15.3%
Same Day494
 
5.5%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
class8476
47.2%
standard5323
29.7%
second1778
 
9.9%
first1375
 
7.7%
same494
 
2.8%
day494
 
2.8%

Most occurring characters

ValueCountFrequency (%)
a20110
17.5%
s18327
15.9%
d12424
10.8%
8970
7.8%
l8476
7.4%
C8476
7.4%
S7595
 
6.6%
n7101
 
6.2%
r6698
 
5.8%
t6698
 
5.8%
Other values (8)10060
8.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter88025
76.6%
Uppercase Letter17940
 
15.6%
Space Separator8970
 
7.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a20110
22.8%
s18327
20.8%
d12424
14.1%
l8476
9.6%
n7101
 
8.1%
r6698
 
7.6%
t6698
 
7.6%
e2272
 
2.6%
c1778
 
2.0%
o1778
 
2.0%
Other values (3)2363
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
C8476
47.2%
S7595
42.3%
F1375
 
7.7%
D494
 
2.8%
Space Separator
ValueCountFrequency (%)
8970
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin105965
92.2%
Common8970
 
7.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a20110
19.0%
s18327
17.3%
d12424
11.7%
l8476
8.0%
C8476
8.0%
S7595
 
7.2%
n7101
 
6.7%
r6698
 
6.3%
t6698
 
6.3%
e2272
 
2.1%
Other values (7)7788
 
7.3%
Common
ValueCountFrequency (%)
8970
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII114935
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a20110
17.5%
s18327
15.9%
d12424
10.8%
8970
7.8%
l8476
7.4%
C8476
7.4%
S7595
 
6.6%
n7101
 
6.2%
r6698
 
5.8%
t6698
 
5.8%
Other values (8)10060
8.8%

CustomerID
Categorical

HIGH CARDINALITY

Distinct792
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
JL-15835
 
33
PP-18955
 
32
MA-17560
 
32
WB-21850
 
32
SV-20365
 
31
Other values (787)
8810 

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters71760
Distinct characters40
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)0.1%

Sample

1st rowCG-12520
2nd rowCG-12520
3rd rowDV-13045
4th rowSO-20335
5th rowSO-20335

Common Values

ValueCountFrequency (%)
JL-1583533
 
0.4%
PP-1895532
 
0.4%
MA-1756032
 
0.4%
WB-2185032
 
0.4%
SV-2036531
 
0.3%
EH-1376531
 
0.3%
AP-1091530
 
0.3%
CK-1220529
 
0.3%
JD-1589529
 
0.3%
CS-1225028
 
0.3%
Other values (782)8663
96.6%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
jl-1583533
 
0.4%
ma-1756032
 
0.4%
wb-2185032
 
0.4%
pp-1895532
 
0.4%
sv-2036531
 
0.3%
eh-1376531
 
0.3%
ap-1091530
 
0.3%
ck-1220529
 
0.3%
jd-1589529
 
0.3%
cl-1256528
 
0.3%
Other values (782)8663
96.6%

Most occurring characters

ValueCountFrequency (%)
110721
14.9%
-8970
12.5%
07600
 
10.6%
57100
 
9.9%
24186
 
5.8%
62613
 
3.6%
72609
 
3.6%
92588
 
3.6%
82548
 
3.6%
32524
 
3.5%
Other values (30)20301
28.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number44850
62.5%
Uppercase Letter17904
 
24.9%
Dash Punctuation8970
 
12.5%
Lowercase Letter36
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S1572
 
8.8%
M1534
 
8.6%
C1534
 
8.6%
B1482
 
8.3%
D1191
 
6.7%
A1111
 
6.2%
J1019
 
5.7%
P1000
 
5.6%
H870
 
4.9%
K836
 
4.7%
Other values (16)5755
32.1%
Decimal Number
ValueCountFrequency (%)
110721
23.9%
07600
16.9%
57100
15.8%
24186
 
9.3%
62613
 
5.8%
72609
 
5.8%
92588
 
5.8%
82548
 
5.7%
32524
 
5.6%
42361
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
p24
66.7%
o7
 
19.4%
l5
 
13.9%
Dash Punctuation
ValueCountFrequency (%)
-8970
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common53820
75.0%
Latin17940
 
25.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
S1572
 
8.8%
M1534
 
8.6%
C1534
 
8.6%
B1482
 
8.3%
D1191
 
6.6%
A1111
 
6.2%
J1019
 
5.7%
P1000
 
5.6%
H870
 
4.8%
K836
 
4.7%
Other values (19)5791
32.3%
Common
ValueCountFrequency (%)
110721
19.9%
-8970
16.7%
07600
14.1%
57100
13.2%
24186
 
7.8%
62613
 
4.9%
72609
 
4.8%
92588
 
4.8%
82548
 
4.7%
32524
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII71760
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
110721
14.9%
-8970
12.5%
07600
 
10.6%
57100
 
9.9%
24186
 
5.8%
62613
 
3.6%
72609
 
3.6%
92588
 
3.6%
82548
 
3.6%
32524
 
3.5%
Other values (30)20301
28.3%

CustomerName
Categorical

HIGH CARDINALITY

Distinct792
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
John Lee
 
33
Paul Prost
 
32
Matt Abelman
 
32
William Brown
 
32
Seth Vernon
 
31
Other values (787)
8810 

Length

Max length22
Median length18
Mean length12.9509476
Min length7

Characters and Unicode

Total characters116170
Distinct characters57
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)0.1%

Sample

1st rowClaire Gute
2nd rowClaire Gute
3rd rowDarrin Van Huff
4th rowSean O'Donnell
5th rowSean O'Donnell

Common Values

ValueCountFrequency (%)
John Lee33
 
0.4%
Paul Prost32
 
0.4%
Matt Abelman32
 
0.4%
William Brown32
 
0.4%
Seth Vernon31
 
0.3%
Edward Hooks31
 
0.3%
Arthur Prichep30
 
0.3%
Chloris Kastensmidt29
 
0.3%
Jonathan Doherty29
 
0.3%
Chris Selesnick28
 
0.3%
Other values (782)8663
96.6%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
john101
 
0.6%
michael98
 
0.5%
frank98
 
0.5%
patrick89
 
0.5%
brian88
 
0.5%
rick87
 
0.5%
paul86
 
0.5%
ken81
 
0.5%
stewart80
 
0.4%
brown76
 
0.4%
Other values (899)17114
95.1%

Most occurring characters

ValueCountFrequency (%)
a10723
 
9.2%
e10670
 
9.2%
n9248
 
8.0%
9028
 
7.8%
r8512
 
7.3%
i7078
 
6.1%
l5824
 
5.0%
o5223
 
4.5%
t4858
 
4.2%
s4073
 
3.5%
Other values (47)40933
35.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter88617
76.3%
Uppercase Letter18383
 
15.8%
Space Separator9028
 
7.8%
Other Punctuation115
 
0.1%
Dash Punctuation27
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a10723
12.1%
e10670
12.0%
n9248
10.4%
r8512
9.6%
i7078
 
8.0%
l5824
 
6.6%
o5223
 
5.9%
t4858
 
5.5%
s4073
 
4.6%
h3454
 
3.9%
Other values (18)18954
21.4%
Uppercase Letter
ValueCountFrequency (%)
C1632
 
8.9%
S1572
 
8.6%
M1570
 
8.5%
B1530
 
8.3%
D1220
 
6.6%
A1162
 
6.3%
J1019
 
5.5%
P1000
 
5.4%
H903
 
4.9%
K867
 
4.7%
Other values (16)5908
32.1%
Space Separator
ValueCountFrequency (%)
9028
100.0%
Other Punctuation
ValueCountFrequency (%)
'115
100.0%
Dash Punctuation
ValueCountFrequency (%)
-27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin107000
92.1%
Common9170
 
7.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a10723
 
10.0%
e10670
 
10.0%
n9248
 
8.6%
r8512
 
8.0%
i7078
 
6.6%
l5824
 
5.4%
o5223
 
4.9%
t4858
 
4.5%
s4073
 
3.8%
h3454
 
3.2%
Other values (44)37337
34.9%
Common
ValueCountFrequency (%)
9028
98.5%
'115
 
1.3%
-27
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII116094
99.9%
None76
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a10723
 
9.2%
e10670
 
9.2%
n9248
 
8.0%
9028
 
7.8%
r8512
 
7.3%
i7078
 
6.1%
l5824
 
5.0%
o5223
 
4.5%
t4858
 
4.2%
s4073
 
3.5%
Other values (44)40857
35.2%
None
ValueCountFrequency (%)
ö53
69.7%
ä18
 
23.7%
ü5
 
6.6%

Segment
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
Consumer
4658 
Corporate
2708 
Home Office
1604 

Length

Max length11
Median length8
Mean length8.838350056
Min length8

Characters and Unicode

Total characters79280
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowConsumer
2nd rowConsumer
3rd rowCorporate
4th rowConsumer
5th rowConsumer

Common Values

ValueCountFrequency (%)
Consumer4658
51.9%
Corporate2708
30.2%
Home Office1604
 
17.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
consumer4658
44.1%
corporate2708
25.6%
home1604
 
15.2%
office1604
 
15.2%

Most occurring characters

ValueCountFrequency (%)
o11678
14.7%
e10574
13.3%
r10074
12.7%
C7366
9.3%
m6262
7.9%
n4658
 
5.9%
s4658
 
5.9%
u4658
 
5.9%
f3208
 
4.0%
t2708
 
3.4%
Other values (7)13436
16.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter67102
84.6%
Uppercase Letter10574
 
13.3%
Space Separator1604
 
2.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o11678
17.4%
e10574
15.8%
r10074
15.0%
m6262
9.3%
n4658
 
6.9%
s4658
 
6.9%
u4658
 
6.9%
f3208
 
4.8%
t2708
 
4.0%
p2708
 
4.0%
Other values (3)5916
8.8%
Uppercase Letter
ValueCountFrequency (%)
C7366
69.7%
H1604
 
15.2%
O1604
 
15.2%
Space Separator
ValueCountFrequency (%)
1604
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin77676
98.0%
Common1604
 
2.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o11678
15.0%
e10574
13.6%
r10074
13.0%
C7366
9.5%
m6262
8.1%
n4658
 
6.0%
s4658
 
6.0%
u4658
 
6.0%
f3208
 
4.1%
t2708
 
3.5%
Other values (6)11832
15.2%
Common
ValueCountFrequency (%)
1604
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII79280
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o11678
14.7%
e10574
13.3%
r10074
12.7%
C7366
9.3%
m6262
7.9%
n4658
 
5.9%
s4658
 
5.9%
u4658
 
5.9%
f3208
 
4.0%
t2708
 
3.4%
Other values (7)13436
16.9%

Country
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
United States
8970 

Length

Max length13
Median length13
Mean length13
Min length13

Characters and Unicode

Total characters116610
Distinct characters10
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowUnited States
2nd rowUnited States
3rd rowUnited States
4th rowUnited States
5th rowUnited States

Common Values

ValueCountFrequency (%)
United States8970
100.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
united8970
50.0%
states8970
50.0%

Most occurring characters

ValueCountFrequency (%)
t26910
23.1%
e17940
15.4%
U8970
 
7.7%
n8970
 
7.7%
i8970
 
7.7%
d8970
 
7.7%
8970
 
7.7%
S8970
 
7.7%
a8970
 
7.7%
s8970
 
7.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter89700
76.9%
Uppercase Letter17940
 
15.4%
Space Separator8970
 
7.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t26910
30.0%
e17940
20.0%
n8970
 
10.0%
i8970
 
10.0%
d8970
 
10.0%
a8970
 
10.0%
s8970
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
U8970
50.0%
S8970
50.0%
Space Separator
ValueCountFrequency (%)
8970
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin107640
92.3%
Common8970
 
7.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
t26910
25.0%
e17940
16.7%
U8970
 
8.3%
n8970
 
8.3%
i8970
 
8.3%
d8970
 
8.3%
S8970
 
8.3%
a8970
 
8.3%
s8970
 
8.3%
Common
ValueCountFrequency (%)
8970
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII116610
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t26910
23.1%
e17940
15.4%
U8970
 
7.7%
n8970
 
7.7%
i8970
 
7.7%
d8970
 
7.7%
8970
 
7.7%
S8970
 
7.7%
a8970
 
7.7%
s8970
 
7.7%

City
Categorical

HIGH CARDINALITY

Distinct519
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
New York City
896 
Los Angeles
736 
San Francisco
 
498
Philadelphia
 
433
Seattle
 
419
Other values (514)
5988 

Length

Max length17
Median length15
Mean length9.423411371
Min length4

Characters and Unicode

Total characters84528
Distinct characters51
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)0.8%

Sample

1st rowHenderson
2nd rowHenderson
3rd rowLos Angeles
4th rowFort Lauderdale
5th rowFort Lauderdale

Common Values

ValueCountFrequency (%)
New York City896
 
10.0%
Los Angeles736
 
8.2%
San Francisco498
 
5.6%
Philadelphia433
 
4.8%
Seattle419
 
4.7%
Houston259
 
2.9%
Chicago212
 
2.4%
Columbus199
 
2.2%
San Diego164
 
1.8%
Springfield147
 
1.6%
Other values (509)5007
55.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
city969
 
7.4%
new917
 
7.0%
york899
 
6.9%
san767
 
5.9%
los736
 
5.6%
angeles736
 
5.6%
francisco498
 
3.8%
philadelphia433
 
3.3%
seattle419
 
3.2%
houston259
 
2.0%
Other values (544)6411
49.1%

Most occurring characters

ValueCountFrequency (%)
e8139
 
9.6%
a6736
 
8.0%
o6648
 
7.9%
n5660
 
6.7%
i5534
 
6.5%
l5294
 
6.3%
s4290
 
5.1%
r4090
 
4.8%
4074
 
4.8%
t4064
 
4.8%
Other values (41)29999
35.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter67410
79.7%
Uppercase Letter13044
 
15.4%
Space Separator4074
 
4.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e8139
12.1%
a6736
10.0%
o6648
9.9%
n5660
 
8.4%
i5534
 
8.2%
l5294
 
7.9%
s4290
 
6.4%
r4090
 
6.1%
t4064
 
6.0%
c2166
 
3.2%
Other values (16)14789
21.9%
Uppercase Letter
ValueCountFrequency (%)
C1871
14.3%
S1643
12.6%
L1231
9.4%
A1157
8.9%
N1104
8.5%
Y917
 
7.0%
P831
 
6.4%
F744
 
5.7%
D554
 
4.2%
H477
 
3.7%
Other values (14)2515
19.3%
Space Separator
ValueCountFrequency (%)
4074
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin80454
95.2%
Common4074
 
4.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e8139
 
10.1%
a6736
 
8.4%
o6648
 
8.3%
n5660
 
7.0%
i5534
 
6.9%
l5294
 
6.6%
s4290
 
5.3%
r4090
 
5.1%
t4064
 
5.1%
c2166
 
2.7%
Other values (40)27833
34.6%
Common
ValueCountFrequency (%)
4074
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII84528
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e8139
 
9.6%
a6736
 
8.0%
o6648
 
7.9%
n5660
 
6.7%
i5534
 
6.5%
l5294
 
6.3%
s4290
 
5.1%
r4090
 
4.8%
4074
 
4.8%
t4064
 
4.8%
Other values (41)29999
35.5%

State
Categorical

HIGH CORRELATION

Distinct49
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
California
1963 
New York
1103 
Texas
695 
Washington
495 
Pennsylvania
471 
Other values (44)
4243 

Length

Max length20
Median length14
Mean length8.599442586
Min length4

Characters and Unicode

Total characters77137
Distinct characters46
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowKentucky
2nd rowKentucky
3rd rowCalifornia
4th rowFlorida
5th rowFlorida

Common Values

ValueCountFrequency (%)
California1963
21.9%
New York1103
12.3%
Texas695
 
7.7%
Washington495
 
5.5%
Pennsylvania471
 
5.3%
Ohio383
 
4.3%
Illinois331
 
3.7%
Florida312
 
3.5%
Michigan252
 
2.8%
Virginia217
 
2.4%
Other values (39)2748
30.6%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
california1963
18.5%
new1296
 
12.2%
york1103
 
10.4%
texas695
 
6.5%
washington495
 
4.7%
pennsylvania471
 
4.4%
ohio383
 
3.6%
illinois331
 
3.1%
florida312
 
2.9%
michigan252
 
2.4%
Other values (43)3311
31.2%

Most occurring characters

ValueCountFrequency (%)
a9858
12.8%
i9077
11.8%
n7314
 
9.5%
o6638
 
8.6%
r5200
 
6.7%
e4456
 
5.8%
l4186
 
5.4%
s3939
 
5.1%
C2441
 
3.2%
f1973
 
2.6%
Other values (36)22055
28.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter64893
84.1%
Uppercase Letter10602
 
13.7%
Space Separator1642
 
2.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a9858
15.2%
i9077
14.0%
n7314
11.3%
o6638
10.2%
r5200
8.0%
e4456
6.9%
l4186
 
6.5%
s3939
 
6.1%
f1973
 
3.0%
h1748
 
2.7%
Other values (14)10504
16.2%
Uppercase Letter
ValueCountFrequency (%)
C2441
23.0%
N1582
14.9%
Y1103
10.4%
T848
 
8.0%
M750
 
7.1%
W608
 
5.7%
I583
 
5.5%
O549
 
5.2%
P471
 
4.4%
F312
 
2.9%
Other values (11)1355
12.8%
Space Separator
ValueCountFrequency (%)
1642
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin75495
97.9%
Common1642
 
2.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a9858
13.1%
i9077
12.0%
n7314
 
9.7%
o6638
 
8.8%
r5200
 
6.9%
e4456
 
5.9%
l4186
 
5.5%
s3939
 
5.2%
C2441
 
3.2%
f1973
 
2.6%
Other values (35)20413
27.0%
Common
ValueCountFrequency (%)
1642
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII77137
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a9858
12.8%
i9077
11.8%
n7314
 
9.5%
o6638
 
8.6%
r5200
 
6.7%
e4456
 
5.8%
l4186
 
5.4%
s3939
 
5.1%
C2441
 
3.2%
f1973
 
2.6%
Other values (36)22055
28.6%

PostalCode
Real number (ℝ≥0)

HIGH CORRELATION

Distinct617
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54959.33489
Minimum1040
Maximum99301
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size70.2 KiB

Quantile statistics

Minimum1040
5-th percentile10009
Q122153
median54601
Q390032
95-th percentile98103
Maximum99301
Range98261
Interquartile range (IQR)67879

Descriptive statistics

Standard deviation32760.86894
Coefficient of variation (CV)0.5960928931
Kurtosis-1.531868321
Mean54959.33489
Median Absolute Deviation (MAD)35403
Skewness-0.1081174326
Sum492985234
Variance1073274534
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10035255
 
2.8%
10009227
 
2.5%
10024227
 
2.5%
94122199
 
2.2%
10011187
 
2.1%
98105163
 
1.8%
94110161
 
1.8%
90049148
 
1.6%
98103148
 
1.6%
94109138
 
1.5%
Other values (607)7117
79.3%
ValueCountFrequency (%)
10401
 
< 0.1%
14536
 
0.1%
17522
 
< 0.1%
18104
 
< 0.1%
184132
0.4%
185216
0.2%
19153
 
< 0.1%
203817
0.2%
21386
 
0.1%
21483
 
< 0.1%
ValueCountFrequency (%)
993016
 
0.1%
992077
 
0.1%
986615
 
0.1%
986323
 
< 0.1%
985025
 
0.1%
982701
 
< 0.1%
982262
 
< 0.1%
982081
 
< 0.1%
981987
 
0.1%
98115108
1.2%

Region
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
West
3043 
East
2611 
Central
1857 
South
1459 

Length

Max length7
Median length4
Mean length4.783723523
Min length4

Characters and Unicode

Total characters42910
Distinct characters14
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSouth
2nd rowSouth
3rd rowWest
4th rowSouth
5th rowSouth

Common Values

ValueCountFrequency (%)
West3043
33.9%
East2611
29.1%
Central1857
20.7%
South1459
16.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
west3043
33.9%
east2611
29.1%
central1857
20.7%
south1459
16.3%

Most occurring characters

ValueCountFrequency (%)
t8970
20.9%
s5654
13.2%
e4900
11.4%
a4468
10.4%
W3043
 
7.1%
E2611
 
6.1%
C1857
 
4.3%
n1857
 
4.3%
r1857
 
4.3%
l1857
 
4.3%
Other values (4)5836
13.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter33940
79.1%
Uppercase Letter8970
 
20.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t8970
26.4%
s5654
16.7%
e4900
14.4%
a4468
13.2%
n1857
 
5.5%
r1857
 
5.5%
l1857
 
5.5%
o1459
 
4.3%
u1459
 
4.3%
h1459
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
W3043
33.9%
E2611
29.1%
C1857
20.7%
S1459
16.3%

Most occurring scripts

ValueCountFrequency (%)
Latin42910
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
t8970
20.9%
s5654
13.2%
e4900
11.4%
a4468
10.4%
W3043
 
7.1%
E2611
 
6.1%
C1857
 
4.3%
n1857
 
4.3%
r1857
 
4.3%
l1857
 
4.3%
Other values (4)5836
13.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII42910
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t8970
20.9%
s5654
13.2%
e4900
11.4%
a4468
10.4%
W3043
 
7.1%
E2611
 
6.1%
C1857
 
4.3%
n1857
 
4.3%
r1857
 
4.3%
l1857
 
4.3%
Other values (4)5836
13.6%

ProductID
Categorical

HIGH CARDINALITY

Distinct1847
Distinct (%)20.6%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
OFF-PA-10001970
 
18
TEC-AC-10003832
 
17
TEC-AC-10002049
 
15
FUR-CH-10001146
 
15
TEC-AC-10003628
 
14
Other values (1842)
8891 

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters134550
Distinct characters27
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)1.3%

Sample

1st rowFUR-BO-10001798
2nd rowFUR-CH-10000454
3rd rowOFF-LA-10000240
4th rowFUR-TA-10000577
5th rowOFF-ST-10000760

Common Values

ValueCountFrequency (%)
OFF-PA-1000197018
 
0.2%
TEC-AC-1000383217
 
0.2%
TEC-AC-1000204915
 
0.2%
FUR-CH-1000114615
 
0.2%
TEC-AC-1000362814
 
0.2%
FUR-CH-1000377414
 
0.2%
FUR-CH-1000264714
 
0.2%
OFF-PA-1000237714
 
0.2%
FUR-CH-1000288014
 
0.2%
FUR-CH-1000428713
 
0.1%
Other values (1837)8822
98.4%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
off-pa-1000197018
 
0.2%
tec-ac-1000383217
 
0.2%
tec-ac-1000204915
 
0.2%
fur-ch-1000114615
 
0.2%
tec-ac-1000362814
 
0.2%
fur-ch-1000377414
 
0.2%
fur-ch-1000264714
 
0.2%
off-pa-1000237714
 
0.2%
fur-ch-1000288014
 
0.2%
tec-ac-1000303813
 
0.1%
Other values (1837)8822
98.4%

Most occurring characters

ValueCountFrequency (%)
031448
23.4%
-17940
13.3%
F13456
10.0%
113451
10.0%
O5532
 
4.1%
34355
 
3.2%
24342
 
3.2%
44330
 
3.2%
A4248
 
3.2%
C3213
 
2.4%
Other values (17)32235
24.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number71760
53.3%
Uppercase Letter44850
33.3%
Dash Punctuation17940
 
13.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
F13456
30.0%
O5532
12.3%
A4248
 
9.5%
C3213
 
7.2%
T2928
 
6.5%
U2920
 
6.5%
R2715
 
6.1%
P2616
 
5.8%
E2040
 
4.5%
H1480
 
3.3%
Other values (6)3702
 
8.3%
Decimal Number
ValueCountFrequency (%)
031448
43.8%
113451
18.7%
34355
 
6.1%
24342
 
6.1%
44330
 
6.0%
53098
 
4.3%
72760
 
3.8%
92720
 
3.8%
62662
 
3.7%
82594
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
-17940
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common89700
66.7%
Latin44850
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
F13456
30.0%
O5532
12.3%
A4248
 
9.5%
C3213
 
7.2%
T2928
 
6.5%
U2920
 
6.5%
R2715
 
6.1%
P2616
 
5.8%
E2040
 
4.5%
H1480
 
3.3%
Other values (6)3702
 
8.3%
Common
ValueCountFrequency (%)
031448
35.1%
-17940
20.0%
113451
15.0%
34355
 
4.9%
24342
 
4.8%
44330
 
4.8%
53098
 
3.5%
72760
 
3.1%
92720
 
3.0%
62662
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII134550
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
031448
23.4%
-17940
13.3%
F13456
10.0%
113451
10.0%
O5532
 
4.1%
34355
 
3.2%
24342
 
3.2%
44330
 
3.2%
A4248
 
3.2%
C3213
 
2.4%
Other values (17)32235
24.0%

Category
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
Office Supplies
5257 
Furniture
1927 
Technology
1786 

Length

Max length15
Median length15
Mean length12.7154961
Min length9

Characters and Unicode

Total characters114058
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFurniture
2nd rowFurniture
3rd rowOffice Supplies
4th rowFurniture
5th rowOffice Supplies

Common Values

ValueCountFrequency (%)
Office Supplies5257
58.6%
Furniture1927
 
21.5%
Technology1786
 
19.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
office5257
37.0%
supplies5257
37.0%
furniture1927
 
13.5%
technology1786
 
12.6%

Most occurring characters

ValueCountFrequency (%)
e14227
12.5%
i12441
10.9%
p10514
 
9.2%
f10514
 
9.2%
u9111
 
8.0%
c7043
 
6.2%
l7043
 
6.2%
O5257
 
4.6%
s5257
 
4.6%
S5257
 
4.6%
Other values (10)27394
24.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter94574
82.9%
Uppercase Letter14227
 
12.5%
Space Separator5257
 
4.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e14227
15.0%
i12441
13.2%
p10514
11.1%
f10514
11.1%
u9111
9.6%
c7043
7.4%
l7043
7.4%
s5257
 
5.6%
r3854
 
4.1%
n3713
 
3.9%
Other values (5)10857
11.5%
Uppercase Letter
ValueCountFrequency (%)
O5257
37.0%
S5257
37.0%
F1927
 
13.5%
T1786
 
12.6%
Space Separator
ValueCountFrequency (%)
5257
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin108801
95.4%
Common5257
 
4.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e14227
13.1%
i12441
11.4%
p10514
9.7%
f10514
9.7%
u9111
8.4%
c7043
 
6.5%
l7043
 
6.5%
O5257
 
4.8%
s5257
 
4.8%
S5257
 
4.8%
Other values (9)22137
20.3%
Common
ValueCountFrequency (%)
5257
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII114058
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e14227
12.5%
i12441
10.9%
p10514
 
9.2%
f10514
 
9.2%
u9111
 
8.0%
c7043
 
6.2%
l7043
 
6.2%
O5257
 
4.6%
s5257
 
4.6%
S5257
 
4.6%
Other values (10)27394
24.0%

SubCategory
Categorical

HIGH CORRELATION

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
Paper
1348 
Binders
889 
Phones
875 
Storage
831 
Furnishings
804 
Other values (12)
4223 

Length

Max length11
Median length9
Mean length7.115942029
Min length3

Characters and Unicode

Total characters63830
Distinct characters28
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBookcases
2nd rowChairs
3rd rowLabels
4th rowTables
5th rowStorage

Common Values

ValueCountFrequency (%)
Paper1348
15.0%
Binders889
9.9%
Phones875
9.8%
Storage831
9.3%
Furnishings804
9.0%
Art788
8.8%
Accessories754
8.4%
Chairs605
6.7%
Appliances393
 
4.4%
Labels354
 
3.9%
Other values (7)1329
14.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
paper1348
15.0%
binders889
9.9%
phones875
9.8%
storage831
9.3%
furnishings804
9.0%
art788
8.8%
accessories754
8.4%
chairs605
6.7%
appliances393
 
4.4%
labels354
 
3.9%
Other values (7)1329
14.8%

Most occurring characters

ValueCountFrequency (%)
s8733
13.7%
e7992
12.5%
r6298
 
9.9%
i4595
 
7.2%
a4349
 
6.8%
n4319
 
6.8%
o3196
 
5.0%
p2834
 
4.4%
h2373
 
3.7%
P2223
 
3.5%
Other values (18)16918
26.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter54860
85.9%
Uppercase Letter8970
 
14.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s8733
15.9%
e7992
14.6%
r6298
11.5%
i4595
8.4%
a4349
7.9%
n4319
7.9%
o3196
 
5.8%
p2834
 
5.2%
h2373
 
4.3%
c2197
 
4.0%
Other values (8)7974
14.5%
Uppercase Letter
ValueCountFrequency (%)
P2223
24.8%
A1935
21.6%
B1096
12.2%
S1020
11.4%
F1015
11.3%
C673
 
7.5%
L354
 
3.9%
T311
 
3.5%
E254
 
2.8%
M89
 
1.0%

Most occurring scripts

ValueCountFrequency (%)
Latin63830
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
s8733
13.7%
e7992
12.5%
r6298
 
9.9%
i4595
 
7.2%
a4349
 
6.8%
n4319
 
6.8%
o3196
 
5.0%
p2834
 
4.4%
h2373
 
3.7%
P2223
 
3.5%
Other values (18)16918
26.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII63830
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s8733
13.7%
e7992
12.5%
r6298
 
9.9%
i4595
 
7.2%
a4349
 
6.8%
n4319
 
6.8%
o3196
 
5.0%
p2834
 
4.4%
h2373
 
3.7%
P2223
 
3.5%
Other values (18)16918
26.5%

ProductName
Categorical

HIGH CARDINALITY

Distinct1835
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Memory size70.2 KiB
Staple envelope
 
48
Easy-staple paper
 
46
Staples
 
43
Staple remover
 
18
KI Adjustable-Height Table
 
18
Other values (1830)
8797 

Length

Max length127
Median length78
Mean length36.46465998
Min length5

Characters and Unicode

Total characters327088
Distinct characters85
Distinct categories12 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique121 ?
Unique (%)1.3%

Sample

1st rowBush Somerset Collection Bookcase
2nd rowHon Deluxe Fabric Upholstered Stacking Chairs, Rounded Back
3rd rowSelf-Adhesive Address Labels for Typewriters by Universal
4th rowBretford CR4500 Series Slim Rectangular Table
5th rowEldon Fold 'N Roll Cart System

Common Values

ValueCountFrequency (%)
Staple envelope48
 
0.5%
Easy-staple paper46
 
0.5%
Staples43
 
0.5%
Staple remover18
 
0.2%
KI Adjustable-Height Table18
 
0.2%
Staples in misc. colors17
 
0.2%
Logitech 910-002974 M325 Wireless Mouse for Web Scrolling14
 
0.2%
Situations Contoured Folding Chairs, 4/Set14
 
0.2%
Global Wood Trimmed Manager's Task Chair, Khaki14
 
0.2%
Global High-Back Leather Tilter, Burgundy14
 
0.2%
Other values (1825)8724
97.3%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
xerox851
 
1.7%
x641
 
1.3%
558
 
1.1%
with511
 
1.0%
chair461
 
0.9%
avery446
 
0.9%
for445
 
0.9%
black375
 
0.8%
phone367
 
0.7%
file322
 
0.6%
Other values (2781)44818
90.0%

Most occurring characters

ValueCountFrequency (%)
40455
 
12.4%
e29872
 
9.1%
r18343
 
5.6%
o17781
 
5.4%
a17158
 
5.2%
i16150
 
4.9%
l14624
 
4.5%
n13362
 
4.1%
s12830
 
3.9%
t12818
 
3.9%
Other values (75)133695
40.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter210450
64.3%
Uppercase Letter49717
 
15.2%
Space Separator40874
 
12.5%
Decimal Number16864
 
5.2%
Other Punctuation6340
 
1.9%
Dash Punctuation2611
 
0.8%
Final Punctuation66
 
< 0.1%
Close Punctuation56
 
< 0.1%
Open Punctuation56
 
< 0.1%
Math Symbol31
 
< 0.1%
Other values (2)23
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e29872
14.2%
r18343
 
8.7%
o17781
 
8.4%
a17158
 
8.2%
i16150
 
7.7%
l14624
 
6.9%
n13362
 
6.3%
s12830
 
6.1%
t12818
 
6.1%
c7797
 
3.7%
Other values (18)49715
23.6%
Uppercase Letter
ValueCountFrequency (%)
S5667
 
11.4%
C5276
 
10.6%
B4611
 
9.3%
P4308
 
8.7%
M2633
 
5.3%
A2588
 
5.2%
D2554
 
5.1%
T2405
 
4.8%
F2284
 
4.6%
L2011
 
4.0%
Other values (16)15380
30.9%
Other Punctuation
ValueCountFrequency (%)
,2739
43.2%
/1402
22.1%
"1128
17.8%
.429
 
6.8%
&259
 
4.1%
'229
 
3.6%
#89
 
1.4%
%42
 
0.7%
*8
 
0.1%
!5
 
0.1%
Other values (2)10
 
0.2%
Decimal Number
ValueCountFrequency (%)
13486
20.7%
02742
16.3%
22135
12.7%
41617
9.6%
31431
8.5%
51381
 
8.2%
91183
 
7.0%
81169
 
6.9%
6887
 
5.3%
7833
 
4.9%
Space Separator
ValueCountFrequency (%)
40455
99.0%
 419
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
-2611
100.0%
Final Punctuation
ValueCountFrequency (%)
66
100.0%
Close Punctuation
ValueCountFrequency (%)
)56
100.0%
Open Punctuation
ValueCountFrequency (%)
(56
100.0%
Math Symbol
ValueCountFrequency (%)
+31
100.0%
Initial Punctuation
ValueCountFrequency (%)
18
100.0%
Other Number
ValueCountFrequency (%)
¾5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin260167
79.5%
Common66921
 
20.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e29872
 
11.5%
r18343
 
7.1%
o17781
 
6.8%
a17158
 
6.6%
i16150
 
6.2%
l14624
 
5.6%
n13362
 
5.1%
s12830
 
4.9%
t12818
 
4.9%
c7797
 
3.0%
Other values (44)99432
38.2%
Common
ValueCountFrequency (%)
40455
60.5%
13486
 
5.2%
02742
 
4.1%
,2739
 
4.1%
-2611
 
3.9%
22135
 
3.2%
41617
 
2.4%
31431
 
2.1%
/1402
 
2.1%
51381
 
2.1%
Other values (21)6922
 
10.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII326566
99.8%
None438
 
0.1%
Punctuation84
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
40455
 
12.4%
e29872
 
9.1%
r18343
 
5.6%
o17781
 
5.4%
a17158
 
5.3%
i16150
 
4.9%
l14624
 
4.5%
n13362
 
4.1%
s12830
 
3.9%
t12818
 
3.9%
Other values (69)133173
40.8%
None
ValueCountFrequency (%)
 419
95.7%
é12
 
2.7%
¾5
 
1.1%
à2
 
0.5%
Punctuation
ValueCountFrequency (%)
66
78.6%
18
 
21.4%

Sales
Real number (ℝ≥0)

HIGH CORRELATION

Distinct5150
Distinct (%)57.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean237.502482
Minimum0.9900000095
Maximum22638.48047
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size35.2 KiB

Quantile statistics

Minimum0.9900000095
5-th percentile6.137999868
Q119.44000053
median60.84000015
Q3225.2960052
95-th percentile960.8067932
Maximum22638.48047
Range22637.49047
Interquartile range (IQR)205.8560047

Descriptive statistics

Standard deviation630.8734131
Coefficient of variation (CV)2.656281348
Kurtosis316.3005676
Mean237.502482
Median Absolute Deviation (MAD)50.15200043
Skewness13.28466034
Sum2130397.264
Variance398001.2812
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12.9600000456
 
0.6%
15.5520000539
 
0.4%
19.4400005339
 
0.4%
25.9200000836
 
0.4%
10.3680000336
 
0.4%
32.4000015328
 
0.3%
17.9400005321
 
0.2%
6.48000001920
 
0.2%
20.7360000619
 
0.2%
14.9399995817
 
0.2%
Other values (5140)8659
96.5%
ValueCountFrequency (%)
0.99000000951
 
< 0.1%
1.240000011
 
< 0.1%
1.3439999823
< 0.1%
1.4079999921
 
< 0.1%
1.4400000571
 
< 0.1%
1.4479999541
 
< 0.1%
1.5039999491
 
< 0.1%
1.5839999911
 
< 0.1%
1.6319999691
 
< 0.1%
1.6399999861
 
< 0.1%
ValueCountFrequency (%)
22638.480471
< 0.1%
17499.949221
< 0.1%
13999.959961
< 0.1%
11199.967771
< 0.1%
10499.969731
< 0.1%
9449.9501951
< 0.1%
9099.9296881
< 0.1%
8749.9501951
< 0.1%
8399.9755861
< 0.1%
8187.6499021
< 0.1%

Quantity
Real number (ℝ≥0)

HIGH CORRELATION

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.63690078
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size35.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile8
Maximum9
Range8
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.971940128
Coefficient of variation (CV)0.5422034439
Kurtosis0.1128219995
Mean3.63690078
Median Absolute Deviation (MAD)1
Skewness0.8725743112
Sum32623
Variance3.88854787
MonotonicityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
32218
24.7%
22207
24.6%
51111
12.4%
41086
12.1%
1827
 
9.2%
7545
 
6.1%
6514
 
5.7%
9234
 
2.6%
8228
 
2.5%
ValueCountFrequency (%)
1827
 
9.2%
22207
24.6%
32218
24.7%
41086
12.1%
51111
12.4%
6514
 
5.7%
7545
 
6.1%
8228
 
2.5%
9234
 
2.6%
ValueCountFrequency (%)
9234
 
2.6%
8228
 
2.5%
7545
 
6.1%
6514
 
5.7%
51111
12.4%
41086
12.1%
32218
24.7%
22207
24.6%
1827
 
9.2%

Discount
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1034860663
Minimum0
Maximum0.5
Zeros4712
Zeros (%)52.5%
Negative0
Negative (%)0.0%
Memory size35.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.200000003
95-th percentile0.3000000119
Maximum0.5
Range0.5
Interquartile range (IQR)0.200000003

Descriptive statistics

Standard deviation0.1171929389
Coefficient of variation (CV)1.132451383
Kurtosis-0.2922693789
Mean0.1034860663
Median Absolute Deviation (MAD)0
Skewness0.6763803959
Sum928.2700147
Variance0.01373418421
MonotonicityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
04712
52.5%
0.2000000033592
40.0%
0.3000000119224
 
2.5%
0.400000006200
 
2.2%
0.100000001589
 
1.0%
0.566
 
0.7%
0.15000000650
 
0.6%
0.319999992826
 
0.3%
0.449999988111
 
0.1%
ValueCountFrequency (%)
04712
52.5%
0.100000001589
 
1.0%
0.15000000650
 
0.6%
0.2000000033592
40.0%
0.3000000119224
 
2.5%
0.319999992826
 
0.3%
0.400000006200
 
2.2%
0.449999988111
 
0.1%
0.566
 
0.7%
ValueCountFrequency (%)
0.566
 
0.7%
0.449999988111
 
0.1%
0.400000006200
 
2.2%
0.319999992826
 
0.3%
0.3000000119224
 
2.5%
0.2000000033592
40.0%
0.15000000650
 
0.6%
0.100000001589
 
1.0%
04712
52.5%

Profit
Real number (ℝ)

HIGH CORRELATION

Distinct6372
Distinct (%)71.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38.43068337
Minimum-3839.990479
Maximum8399.975586
Zeros64
Zeros (%)0.7%
Negative1001
Negative (%)11.2%
Memory size35.2 KiB

Quantile statistics

Minimum-3839.990479
5-th percentile-31.75479994
Q13.235199928
median9.920800209
Q332.65540028
95-th percentile176.6062248
Maximum8399.975586
Range12239.96606
Interquartile range (IQR)29.42020035

Descriptive statistics

Standard deviation208.543335
Coefficient of variation (CV)5.426480007
Kurtosis523.6069336
Mean38.43068337
Median Absolute Deviation (MAD)9.044400215
Skewness16.22002792
Sum344723.2298
Variance43490.32031
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
064
 
0.7%
6.22079992343
 
0.5%
9.33119964638
 
0.4%
5.44320011132
 
0.4%
3.62879991532
 
0.4%
15.5520000526
 
0.3%
12.4415998521
 
0.2%
7.25759983119
 
0.2%
3.11039996118
 
0.2%
9.0719995511
 
0.1%
Other values (6362)8666
96.6%
ValueCountFrequency (%)
-3839.9904791
< 0.1%
-1811.0783691
< 0.1%
-1665.0522461
< 0.1%
-1359.9919431
< 0.1%
-1049.3405761
< 0.1%
-1002.783631
< 0.1%
-968.88330081
< 0.1%
-944.99462891
< 0.1%
-814.48321531
< 0.1%
-786.01440431
< 0.1%
ValueCountFrequency (%)
8399.9755861
< 0.1%
6719.9809571
< 0.1%
5039.985841
< 0.1%
4630.4755861
< 0.1%
3919.988771
< 0.1%
3177.4750981
< 0.1%
2799.9838871
< 0.1%
2591.9567871
< 0.1%
2504.221681
< 0.1%
2400.965821
< 0.1%

Interactions

Correlations

Auto

The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

df_indexRowIDOrderIDOrderDateShipDateShipModeCustomerIDCustomerNameSegmentCountryCityStatePostalCodeRegionProductIDCategorySubCategoryProductNameSalesQuantityDiscountProfit
001CA-2016-15215611/8/201611/11/2016Second ClassCG-12520Claire GuteConsumerUnited StatesHendersonKentucky42420SouthFUR-BO-10001798FurnitureBookcasesBush Somerset Collection Bookcase261.95999120.0041.913601
112CA-2016-15215611/8/201611/11/2016Second ClassCG-12520Claire GuteConsumerUnited StatesHendersonKentucky42420SouthFUR-CH-10000454FurnitureChairsHon Deluxe Fabric Upholstered Stacking Chairs, Rounded Back731.94000230.00219.582001
223CA-2016-1386886/12/20166/16/2016Second ClassDV-13045Darrin Van HuffCorporateUnited StatesLos AngelesCalifornia90036WestOFF-LA-10000240Office SuppliesLabelsSelf-Adhesive Address Labels for Typewriters by Universal14.62000020.006.871400
334US-2015-10896610/11/201510/18/2015Standard ClassSO-20335Sean O'DonnellConsumerUnited StatesFort LauderdaleFlorida33311SouthFUR-TA-10000577FurnitureTablesBretford CR4500 Series Slim Rectangular Table957.57751550.45-383.031006
445US-2015-10896610/11/201510/18/2015Standard ClassSO-20335Sean O'DonnellConsumerUnited StatesFort LauderdaleFlorida33311SouthOFF-ST-10000760Office SuppliesStorageEldon Fold 'N Roll Cart System22.36800020.202.516400
556CA-2014-1158126/9/20146/14/2014Standard ClassBH-11710Brosina HoffmanConsumerUnited StatesLos AngelesCalifornia90032WestFUR-FU-10001487FurnitureFurnishingsEldon Expressions Wood and Plastic Desk Accessories, Cherry Wood48.86000170.0014.169400
667CA-2014-1158126/9/20146/14/2014Standard ClassBH-11710Brosina HoffmanConsumerUnited StatesLos AngelesCalifornia90032WestOFF-AR-10002833Office SuppliesArtNewell 3227.28000040.001.965600
778CA-2014-1158126/9/20146/14/2014Standard ClassBH-11710Brosina HoffmanConsumerUnited StatesLos AngelesCalifornia90032WestTEC-PH-10002275TechnologyPhonesMitel 5320 IP Phone VoIP phone907.15197860.2090.715202
889CA-2014-1158126/9/20146/14/2014Standard ClassBH-11710Brosina HoffmanConsumerUnited StatesLos AngelesCalifornia90032WestOFF-BI-10003910Office SuppliesBindersDXL Angle-View Binders with Locking Rings by Samsill18.50400030.205.782500
9910CA-2014-1158126/9/20146/14/2014Standard ClassBH-11710Brosina HoffmanConsumerUnited StatesLos AngelesCalifornia90032WestOFF-AP-10002892Office SuppliesAppliancesBelkin F5C206VTEL 6 Outlet Surge114.90000250.0034.470001

Last rows

df_indexRowIDOrderIDOrderDateShipDateShipModeCustomerIDCustomerNameSegmentCountryCityStatePostalCodeRegionProductIDCategorySubCategoryProductNameSalesQuantityDiscountProfit
896099839984US-2016-1577289/22/20169/28/2016Standard ClassRC-19960Ryan CroweConsumerUnited StatesGrand RapidsMichigan49505CentralTEC-PH-10001305TechnologyPhonesPanasonic KX TS208W Corded phone97.98000320.027.434401
896199859986CA-2015-1002515/17/20155/23/2015Standard ClassDV-13465Dianna VittoriniConsumerUnited StatesLong BeachNew York11561EastOFF-SU-10000898Office SuppliesSuppliesAcme Hot Forged Carbon Steel Scissors with Nickel-Plated Handles, 3 7/8" Cut, 8"L55.59999840.016.124001
896299869987CA-2016-1257949/29/201610/3/2016Standard ClassML-17410Maris LaWareConsumerUnited StatesLos AngelesCalifornia90008WestTEC-AC-10003399TechnologyAccessoriesMemorex Mini Travel Drive 64 GB USB 2.0 Flash Drive36.24000210.015.220800
896399879988CA-2017-16362911/17/201711/21/2017Standard ClassRA-19885Ruben AusmanCorporateUnited StatesAthensGeorgia30605SouthTEC-AC-10001539TechnologyAccessoriesLogitech G430 Surround Sound Gaming Headset with Dolby 7.1 Technology79.98999810.028.796400
896499889989CA-2017-16362911/17/201711/21/2017Standard ClassRA-19885Ruben AusmanCorporateUnited StatesAthensGeorgia30605SouthTEC-PH-10004006TechnologyPhonesPanasonic KX - TS880B Telephone206.10000650.055.646999
896599899990CA-2014-1104221/21/20141/23/2014Second ClassTB-21400Tom BoeckenhauerConsumerUnited StatesMiamiFlorida33180SouthFUR-FU-10001889FurnitureFurnishingsUltra Door Pull Handle25.24799930.24.102800
896699909991CA-2017-1212582/26/20173/3/2017Standard ClassDB-13060Dave BrooksConsumerUnited StatesCosta MesaCalifornia92627WestFUR-FU-10000747FurnitureFurnishingsTenex B1-RE Series Chair Mats for Low Pile Carpets91.95999920.015.633200
896799919992CA-2017-1212582/26/20173/3/2017Standard ClassDB-13060Dave BrooksConsumerUnited StatesCosta MesaCalifornia92627WestTEC-PH-10003645TechnologyPhonesAastra 57i VoIP phone258.57598920.219.393200
896899929993CA-2017-1212582/26/20173/3/2017Standard ClassDB-13060Dave BrooksConsumerUnited StatesCosta MesaCalifornia92627WestOFF-PA-10004041Office SuppliesPaperIt's Hot Message Books with Stickers, 2 3/4" x 5"29.60000040.013.320000
896999939994CA-2017-1199145/4/20175/9/2017Second ClassCC-12220Chris CortesConsumerUnited StatesWestminsterCalifornia92683WestOFF-AP-10002684Office SuppliesAppliancesAcco 7-Outlet Masterpiece Power Center, Wihtout Fax/Phone Line Protection243.16000420.072.947998